Searchable Metaspaces 1 Overview of Objectives
نویسندگان
چکیده
The purpose of this presentation is to start a discussion about methodological and operational requirements for developing tools for internet browsing and/or querying of meta-descriptions of language resources, in particular multimodal corpora. Among the most important requirements are: delimiting the relationship both between meta-descriptions and the resources they apply to, and between browsing and querying over the internet; establishing a standard for representing meta-descriptions; administering the web-based availability of language resource and their accessibility via meta-descriptions; and establishing user support for query editing and data interchange. We attempt to stake out positions regarding these requirements , addressing both their advantages and disadvantages. We base our positions on the EAGLES/ISLE proposal for a meta-description standard for language resources ((1]). Our views are also innuenced by work on the development of query languages for linguistic resources, such as CQP 1 , the MATE query language Q4M 2 , and the TIGER query language 3 for syntactic tree annotations. With the increasing development and use of multi-modal language resources, there is a growing need for suitable tools to access and query these resources. To facilitate these tasks, it is commonn and essentiallfor the resources to be associated with meta-descriptions of their content (including object data annotations). Two perspectives are possible and relevant in this context: The local or site perspective: an institution has a (number of) multi-modal resource(s), and somebody wants to identify parts of these resources that satisfy certain meta-descriptions. Possibly one even wants to retrieve, from such resource(s), certain subsets (say turns, sentences , whole dialogues), according to a combination of metadata and linguistic (or other modality-speciic) criteria annotated in the resource. The resources are accessed locally and the search is also carried out on site. The global or web perspective: somebody wants to know about the existence of resources of a certain kind (i.e. satisfying certain conditions in terms of meta-descriptions); if a web search engine accepts the required meta-descriptions, then the resources can be located by browsing, and possibly even accessed and queried over the web. Although these perspectives make diierent demands on implementation, we will see that, from the point of view of the language resource user, they complement rather than compete with each other; hence, resource owners should accommodate both perspectives. Following the EAGLES/ISLE
منابع مشابه
Browse searchable encryption schemes: Classification, methods and recent developments
With the advent of cloud computing, data owners tend to submit their data to cloud servers and allow users to access data when needed. However, outsourcing sensitive data will lead to privacy issues. Encrypting data before outsourcing solves privacy issues, but in this case, we will lose the ability to search the data. Searchable encryption (SE) schemes have been proposed to achieve this featur...
متن کاملSearchable Metaspaces
The purpose of this presentation is to start a discussion about methodological and operational requirements for developing tools for internet browsing and/or querying of metadescriptions of language resources, in particular multimodal corpora. Among the most important requirements are: delimiting the relationship both between meta-descriptions and the resources they apply to, and between browsi...
متن کاملAcademic Researcher Information Extraction from the WEB (ARIEW)
Web is a large and growing collection of texts. This amount of text is becoming a valuable resource of information and knowledge. To find useful information in this source is not an easy and fast task. People, however, want to extract useful information from this largest data repository. Academic Researcher Information Extraction from the WEB (ARIEW) is a framework for automatic collection and ...
متن کاملA searchable database of medical education objectives – creating a comparable gold standard
BACKGROUND Medical school curricula strives to teach as much material as can be retained in a limited amount of time. A common "gold standard" resource used building curricula are medical objectives suggested by national societies. Unfortunately these objectives suffer from several functional limitations such as limited accessibility to society members, non-searchable formats (such as nested ta...
متن کاملFuzzy retrieval of encrypted data by multi-purpose data-structures
The growing amount of information that has arisen from emerging technologies has caused organizations to face challenges in maintaining and managing their information. Expanding hardware, human resources, outsourcing data management, and maintenance an external organization in the form of cloud storage services, are two common approaches to overcome these challenges; The first approach costs of...
متن کامل